204 research outputs found
Removing noise from pyrosequenced amplicons
Background
In many environmental genomics applications a homologous region of DNA from a diverse sample is first amplified by PCR and then sequenced. The next generation sequencing technology, 454 pyrosequencing, has allowed much larger read numbers from PCR amplicons than ever before. This has revolutionised the study of microbial diversity as it is now possible to sequence a substantial fraction of the 16S rRNA genes in a community. However, there is a growing realisation that because of the large read numbers and the lack of consensus sequences it is vital to distinguish noise from true sequence diversity in this data. Otherwise this leads to inflated estimates of the number of types or operational taxonomic units (OTUs) present. Three sources of error are important: sequencing error, PCR single base substitutions and PCR chimeras. We present AmpliconNoise, a development of the PyroNoise algorithm that is capable of separately removing 454 sequencing errors and PCR single base errors. We also introduce a novel chimera removal program, Perseus, that exploits the sequence abundances associated with pyrosequencing data. We use data sets where samples of known diversity have been amplified and sequenced to quantify the effect of each of the sources of error on OTU inflation and to validate these algorithms
Making It Last: Storage Time and Temperature Have Differential Impacts on Metabolite Profiles of Airway Samples from Cystic Fibrosis Patients.
Metabolites of human or microbial origin have the potential to be important biomarkers of the disease state in cystic fibrosis (CF). Clinical sample collection and storage conditions may impact metabolite abundances with clinical relevance. We measured the change in metabolite composition based on untargeted gas chromatography-mass spectrometry (GC-MS) when CF sputum samples were stored at 4°C, -20°C, or -80°C with one or two freeze-thaw cycles. Daily measurements were taken for 1 week and then weekly for 4 weeks (4°C) and 8 weeks (-20°C). The metabolites in samples stored at -20°C maintained abundances similar to those found at-80°C over the course of 8 weeks (average change in Bray-Curtis distance, 0.06 ± 0.04) and were also stable after one or two freeze-thaw cycles. However, the metabolite profiles of samples stored at 4°C shifted after 1 day and continued to change over the course of 4 weeks (average change in Bray-Curtis distance, 0.31 ± 0.12). The abundances of several amino acids and other metabolites increased with time of storage at 4°C but remained constant at -20°C. Storage temperature was a significant factor driving the metabolite composition (permutational multivariate analysis of variance: r2 = 0.32 to 0.49, P < 0.001). CF sputum samples stored at -20°C at the time of sampling maintain a relatively stable untargeted GC-MS profile. Samples should be frozen on the day of collection, as more than 1 day at 4°C impacts the global composition of the metabolites in the sample. IMPORTANCE Metabolomics has great potential for uncovering biomarkers of the disease state in CF and many other contexts. However, sample storage timing and temperature may alter the abundance of clinically relevant metabolites. To assess whether existing samples are stable and to direct future study design, we conducted untargeted GC-MS metabolomic analysis of CF sputum samples after one or two freeze-thaw cycles and storage at 4°C and -20°C for 4 to 8 weeks. Overall, storage at -20°C and freeze-thaw cycles had little impact on metabolite profiles; however, storage at 4°C shifted metabolite abundances significantly. GC-MS profiling will aid in our understanding of the CF lung, but care should be taken in studies using sputum samples to ensure that samples are properly stored
Siri, What Should I Eat?
Zeevi et al. report that extensive monitoring of a human cohort for variations in dietary intake, lifestyle, host phenotype, and the gut microbiome has enabled the development of a machine-learning algorithm that accurately predicts the individual glycemic response to meals, providing an important first step toward personalized nutrition
Viewing the human microbiome through three-dimensional glasses: integrating structural and functional studies to better define the properties of myriad carbohydrate-active enzymes
Metagenomics has unleashed a deluge of sequencing data describing the organismal, genetic, and transcriptional diversity of the human microbiome. To better understand the precise functions of the myriad proteins encoded by the microbiome, including carbohydrate-active enzymes, it will be critical to combine structural studies with functional analyses
mockrobiota: a Public Resource for Microbiome Bioinformatics Benchmarking.
Mock communities are an important tool for validating, optimizing, and comparing bioinformatics methods for microbial community analysis. We present mockrobiota, a public resource for sharing, validating, and documenting mock community data resources, available at http://caporaso-lab.github.io/mockrobiota/. The materials contained in mockrobiota include data set and sample metadata, expected composition data (taxonomy or gene annotations or reference sequences for mock community members), and links to raw data (e.g., raw sequence data) for each mock community data set. mockrobiota does not supply physical sample materials directly, but the data set metadata included for each mock community indicate whether physical sample materials are available. At the time of this writing, mockrobiota contains 11 mock community data sets with known species compositions, including bacterial, archaeal, and eukaryotic mock communities, analyzed by high-throughput marker gene sequencing. IMPORTANCE The availability of standard and public mock community data will facilitate ongoing method optimizations, comparisons across studies that share source data, and greater transparency and access and eliminate redundancy. These are also valuable resources for bioinformatics teaching and training. This dynamic resource is intended to expand and evolve to meet the changing needs of the omics community
Recommended from our members
A widely distributed metalloenzyme class enables gut microbial metabolism of host- and diet-derived catechols.
Catechol dehydroxylation is a central chemical transformation in the gut microbial metabolism of plant- and host-derived small molecules. However, the molecular basis for this transformation and its distribution among gut microorganisms are poorly understood. Here, we characterize a molybdenum-dependent enzyme from the human gut bacterium Eggerthella lenta that dehydroxylates catecholamine neurotransmitters. Our findings suggest that this activity enables E. lenta to use dopamine as an electron acceptor. We also identify candidate dehydroxylases that metabolize additional host- and plant-derived catechols. These dehydroxylases belong to a distinct group of largely uncharacterized molybdenum-dependent enzymes that likely mediate primary and secondary metabolism in multiple environments. Finally, we observe catechol dehydroxylation in the gut microbiotas of diverse mammals, confirming the presence of this chemistry in habitats beyond the human gut. These results suggest that the chemical strategies that mediate metabolism and interactions in the human gut are relevant to a broad range of species and habitats
The mind-body-microbial continuum
Our understanding of the vast collection of microbes that live on and inside us (microbiota) and their collective genes (microbiome) has been revolutionized by culture-independent “metagenomic” techniques and DNA sequencing technologies. Most of our microbes live in our gut, where they function as a metabolic organ and provide attributes not encoded in our human genome. Metagenomic studies are revealing shared and distinctive features of microbial communities inhabiting different humans. A central question in psychiatry is the relative role of genes and environment in shaping behavior. The human microbiome serves as the interface between our genes and our history of environmental exposures; explorations of our microbiomes thus offer the possibility of providing new insights into our neurodevelopment and our behavioral phenotypes by affecting complex processes such as inter- and intra personal variations in cognition, personality, mood, sleep, and eating behavior, and perhaps even a variety of neuropsychiatric diseases ranging from affective disorders to autism. Better understanding of microbiome-encoded pathways for xenobiotic metabolism also has important implications for improving the efficacy of pharmacologic interventions with neuromodulator agents
Recommended from our members
Predicting and Manipulating Cardiac Drug Inactivation by the Human Gut Bacterium Eggerthella lenta
Despite numerous examples of the effects of the human gastrointestinal microbiome on drug efficacy and toxicity, there is often an incomplete understanding of the underlying mechanisms. Here, we dissect the inactivation of the cardiac drug digoxin by the gut Actinobacterium Eggerthella lenta. Transcriptional profiling, comparative genomics, and culture-based assays revealed a cytochrome-encoding operon up-regulated by digoxin, inhibited by arginine, absent in nonmetabolizing E. lenta strains, and predictive of digoxin inactivation by the human gut microbiome. Pharmacokinetic studies using gnotobiotic mice revealed that dietary protein reduces the in vivo microbial metabolism of digoxin, with significant changes to drug concentration in the serum and urine. These results emphasize the importance of viewing pharmacology from the perspective of both our human and microbial genomes.Chemistry and Chemical Biolog
Recommended from our members
High-resolution microbial community reconstruction by integrating short reads from multiple 16S rRNA regions
The emergence of massively parallel sequencing technology has revolutionized microbial profiling, allowing the unprecedented comparison of microbial diversity across time and space in a wide range of host-associated and environmental ecosystems. Although the high-throughput nature of such methods enables the detection of low-frequency bacteria, these advances come at the cost of sequencing read length, limiting the phylogenetic resolution possible by current methods. Here, we present a generic approach for integrating short reads from large genomic regions, thus enabling phylogenetic resolution far exceeding current methods. The approach is based on a mapping to a statistical model that is later solved as a constrained optimization problem. We demonstrate the utility of this method by analyzing human saliva and Drosophila samples, using Illumina single-end sequencing of a 750 bp amplicon of the 16S rRNA gene. Phylogenetic resolution is significantly extended while reducing the number of falsely detected bacteria, as compared with standard single-region Roche 454 Pyrosequencing. Our approach can be seamlessly applied to simultaneous sequencing of multiple genes providing a higher resolution view of the composition and activity of complex microbial communities
- …